13 Multiple Variables

1 Joint Distribution

Now we want to explore multiple variables and their interaction. Here we focus on bivariate jointly continuous RVs.

Jointly Continuous

(X,Y) are jointly continuous if a function with joint density function f(x,y), s.t.P((X,Y)B)=Bf(x,y)dxdy,
For BR2 measurable.
So fX,Y(x,y)=limΔ(x,y)P((x,y)Δ)Area(Δ).

A joint density function satisfies

In the one dimensional case, for continuous RV, we have the approximation P(X[x,x+ε))fX(x)ε.
Similarly in two dimentional case, take a small neighborhood Δ containing (x,y), then we have P((X,Y)Δ)fX,YArea(Δ).

How to recover the marginal density fX or fY given fX,Y?

P(Xt)=tfX,Y(x,y)dydx=tfX(x)dx.
Fact

fX(x)=RfX,Y(x,y)dy=RfX,Y(x,y)dx.

X=dW,Y=dZ will not lead to (X,Y)=d(W,Z): let X,W,Vi.i.dExp(1), and ZGamma(2,1). Let Y=X+V. By this result we know YdGamma(2,1). Then X=dW,Y=dZ. However, we always have XY, and W>Z has positive probability, so (X,Y)d(W,Z).

1.1 Independence

X,Y are independent if fX,Y(x,y)=fX(x)fY(y),(x,y)R2.
By independence, the value of X will not give us anything about Y.

2 Bivariate Transformation

Transformation of random variables: T:R2R2,(X,Y)(W,Z).
Polar coordinates: (x,y)(r,θ),x=rcosθ,y=rsinθ.

Fact (Polar Coordinates)

fR,Θ(r,θ)=rfX,Y(rcosθ,rsinθ).

Linear transformation

T:R2R2 is a linear transformation if T(xy)=MT(xy)+PT.MTR2×2,PTR2×1.

Some properties:

Let T be a linear transformation with inverse S. Given the joint p.d.f of (X,Y), what's the joint p.d.f of (W,Z)=T(X,Y)?
On one hand, P((W,Z)P)fW,Z(w,z)Area(P).
Similarly P((W,Z)P)=P(T(X,Y)P)=P((X,Y)S(P))fX,Y(x,y)Area(S(P))=fX,Y(S(w,z))Area(P)|det(MS)|.
We conclude that for invertible transformation T,

fW,Z(w,z)=fX,Y(S(w,z))|det(MS)|.
Orthogonal transformation

T is an orthogonal transformation if it preserves the inner product: v,w=Tv,Tw. I.e.: PT=0, MT is an orthogonal matrix, MT1=MTT.

They preserve angles, lengths, Areas, det(MT)=±1.

Fact

For R2, all orthogonal transformations are rotation, reflection, and composition of the two.

3 Invertible Affine Transformation

Suppose T:R2R2,T(X,Y)=(U,V) has inverse S(U,V)=(X,Y).
Pasted image 20241201150847.png|400
Define linear translation T(X,Y)=MT[XY]+PT=[UV], where MT is 2×2 invertible matrix, and PT is 2×1 vector.
Since ST(X,Y)=S(U,V)=(X,Y), S(U,V)=MT1[UV]MT1PT=MS[UV]+PS.
Since P((U,V)B)=P((X,Y)S(B))BfU,V(u,v)dudv=S(B)fX,Y(x,y)dxdy, we have (3.1)fU,V(u,v)=fX,Y(S(u,v))|detMS|.

4 General Invertible Transformations

T:R2R2. Assume differentiable, but not necessarily affine. Also assume T(X,Y)=(U,V), S(U,V)=(X,Y), ST(X,Y)=(X,Y). So P((U,V)Δ)=P((X,Y)S(Δ))fU,V(u,v)Area(Δ)fX,Y(S(u,v))Area(S(Δ)).
We want to know S(Δ).
If S is affine, then S(Δ) is a parallelogram. For general S, if δ,ε1, then S(Δ) can be approximated by a parallelogram, since S can be approximated by an affine transformation on Δ.
Let S(uv)=[S1(u,v)S2(u,v)]R2, where Si:R2R,i=1,2 are differentiable functions. Then for any point (a,b) near (u,v), the Taylor expansion in second order gives Si(a,b)Si(u,v)+(au)Siu(u,v)+(bv)Siv(u,v).
In matrix notation, S[ab][S1(u,v)S2(u,v)]+[S1u(u,v)S1v(u,v)S2u(u,v)S2v(u,v)][aubv]. This is an affine transformation. Denote the yellow matrix as JS(u,v), which is the Jacobian matrix of S at (u,v).

Hence, P((X,Y)S(Δ))fX,Y(S(u,v))|detJS(u,v)|Area(Δ), so (4.1)fU,V(u,v)=fX,Y(S(u,v))|detJS(u,v)|.

In summary,